Exploring Uncertainty Measures for Image-caption Embedding-and-retrieval Task
نویسندگان
چکیده
With the significant development of black-box machine learning algorithms, particularly deep neural networks, practical demand for reliability assessment is rapidly increasing. On basis concept that “Bayesian knows what it does not know,” uncertainty network outputs has been investigated as a measure classification and regression tasks. By considering an embedding task task, several existing studies have quantified embedded features improved retrieval performance cutting-edge models by model averaging. However, in image-caption embedding-and-retrieval tasks, well-known samples are always easy to retrieve. This study shows method poor investigates another aspect We propose posterior which can accurately assess results. The consistent two measures observed with different datasets (MS-COCO Flickr30k), deep-learning architectures (dropout batch normalization), similarity functions. To best our knowledge, this first perform on
منابع مشابه
evaluation of dissimilarity measures for image retrieval and classification
in this paper, the performance of 11 different distances for image retrieval and classification, based on color, shape and texture, is evaluated. the precision-recall measure and the correct classification rate of the k-nn classifier are used to evaluate retrieval and classification performances, respectively. the experimental results for a database of 1000 images from 10 different semantic gro...
متن کاملCaption vs. Query Translation for Cross-Language Image Retrieval
For many cross-language retrieval tasks, the predominant approach is to translate the query into the language of the document collection (target language). This often gives results as good as, if not better, than translating the document collection into the query language (source language). In this paper, we evaluate query versus document translation for the ImageCLEF 2004 bilingual ad hoc retr...
متن کاملShape measures for image retrieval
One of the main goals in Content Based Image Retrieval (CBIR) is to incorporate shape into the process in a reliable manner. In order to overcome the difficulties of directly obtaining shape information (in particular avoiding region segmentation) we develop several shape measures that tackle the problem in an indirect manner, requiring only a minimal amount of segmentation. A histogram-based s...
متن کاملISIA at the ImageCLEF 2017 Image Caption Task
This paper describes the details of our methods for participation in the caption prediction task of ImageCLEF 2017. The dataset we use is all provided by the organizers and doesn’t include any external resources. The key components of our framework include a deep model part, an SVM part and a caption retrieval part. In deep model part, we use an end to end architecture with Convolutional neural...
متن کامل» research note « evaluation of dissimilarity measures for image retrieval and classification
in this paper, the performance of 11 different distances for image retrieval and classification, based on color, shape and texture, is evaluated. the precision-recall measure and the correct classification rate of the k-nn classifier are used to evaluate retrieval and classification performances, respectively. the experimental results for a database of 1000 images from 10 different semantic gro...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: ACM Transactions on Multimedia Computing, Communications, and Applications
سال: 2021
ISSN: ['1551-6857', '1551-6865']
DOI: https://doi.org/10.1145/3425663